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PRODUCTION OF COMPLEX CARBOHYDRATES 



CLAIM OF PRIORITY 
This application claims priority under 35 U.S.C. 1 19(e) from U.S. 
5 Provisional Application Serial No. 60/134,756, filed May 18, 1999, which 
application is incorporated herein by reference. 

GOVERNMENTAL RIGHTS 
The United States Government retains certain rights in this invention. 
10 Financial support was provided by the National Institute of Allergy and Infectious 
Diseases under Grant Number A124016 and from the NIH National Center for 
Research Resources under Grants Number RR01614 and RR041 12, 



FIELD OF THE INVENTION 
15 This invention relates to a method for the production of complex 

carbohydrates on an LPS backbone structure in Gram-negative bacteria. 



O BACKGROUND OF THE INVENTION 

f? Complex carbohydrates occur in nature and are involved in a wide array of 

Cfl 20 biological functions, including viral, bacterial and fungal pathogenesis, cell-to-cell 

q and intracellular recognition, binding of hormones and pathogens to cell-surface 

receptors and in antigen-antibody recognition. The term "complex carbohydrates" 
embraces a wide array of chemical compounds having the general formula (CH 2 0) n 
where the monomer unit is selected from any of thousands of naturally occurring or 
25 synthetic monomers, including, but not limited to, glucose, galactose, mannose, 

fucose and sialic acid. Saccharides may have additional constituents such as amino, 
sulfate or phosphate groups, in addition to the carbon-hydrogen-oxygen core. The 
polymer consisting of two to ten saccharide units is termed an oligosaccharide (OS) 
and that consisting of more than ten saccharide units is termed a polysaccharide 
30 (PS). These monosaccharide building blocks can be linked in at least 10 different 
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ways, leading to an astronomical number of different combinations and 
permutations. It is found that strains within species and even tissue within an 
organism differ in complex carbohydrate structure. This high degree of variability, 
the highly specific composition of naturally occurring complex carbohydrates and 
5 the wide range of biological roles make these compounds especially significant. 

Gram-negative bacteria contain complex carbohydrates, which are linked to 
lipids to form lipooligosaccharides (LOS) or lipopolysaccharides (LPS.) The 
immunogenicity of the LOS and LPS resides in the carbohydrate moiety, while 
pathogenicity resides in the lipid moiety. For this reason, OS and PS are useful as 
10 vaccines against Gram-negative pathogens and for identification of gram-negative 
bacteria. 

United States Patent Application 5,736,533 discloses oligosaccharides useful 
as therapeutic agents against pathogens that are the causative agents of respiratory 
infections. It is believed that pathogenic bacteria are able to colonize tissue by 
15 binding to carbohydrates on the surface of the tissue and that providing an excess of 
specific soluble oligosaccharides can result in competitive inhibition of bacterial 
colonization. 

OS and PS from LPS and LOS can be produced by growing the specific 
bacterial pathogen in culture, with subsequent cleavage of the lipid moiety and 

20 purification. However, most pathogenic bacteria are fastidious in their growth 
requirements and slow growing, making this mode of production impractical. For 
example, Haemophilus influenzae is known to require a carbon dioxide atmosphere 
and brain/heart extract for growth. Helicobacter pylori grows very poorly in broth 
cultures required for OS and PS production. In addition, many of these bacterial 

25 pathogens (for example, Neisseria meningitidis) can be dangerous to grow in large 
volumes because of the risk of aerosol and possible infection spread. The ability to 
produce the OS and PS structures of fastidious bacterial pathogens in bacterial 
strains such as Escherichia coli and Salmonella Minnesota which grow rapidly to 
high density offers a rapid way to produce these OS and PS from fastidious bacterial 

30 pathogens. 
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Eucaryotic proteins and peptides frequently have carbohydrate moieties on 
their surfaces, which act as specific binding sites for hormones, which are also 
glycosylated, that is, have complex carbohydrates linked to the peptide structure. 
Moreover, in addition to the recognition role, carbohydrates are necessary to the 
5 proper three-dimensional folding of polypeptides into functional glycoproteins. 
Bacteria do not glycosylate peptides and proteins efficiently or in a manner 
equivalent to that of eucaryotes. For that reason, although bacteria are widely used 
as production cells for growing eucaryotic peptides and proteins, such useful human 
glycopeptides such as erythropoetin are grown in mammalian cells. United States 

10 Patent Number 4,703,008 discloses a method for the production of erythropoietin, in 
which cells such as Chinese hamster ovary cells are transfected with the DNA 
coding for the hormone and grown under a carbon dioxide atmosphere in complex 
medium. The resulting hormone is sufficiently similar to the naturally occurring 
hormone to be effective as a therapeutic for human use. 

15 An additional utility for isolated, cell-specific carbohydrates is for 

competitive inhibition of disease agents in which infection is reliant on surface- 
recognition glycosylated proteins. For example, the human immunodeficiency virus 
is known to bind to the surface receptor on T-4 lymphocytes. If an excess of free T- 
4 receptor carbohydrate is present in the bodily fluids of the patient, the virus will 

20 bind to the free carbohydrate and is effectively prevented from infecting the T-4 
lymphocyte. 

Competitive inhibition of binding of antibodies to cell surfaces by 
administration of cell-recognition molecules may have therapeutic potential in the 
treatment of autoimmune diseases such as lupus erythematosus, multiple sclerosis 
25 and rheumatoid arthritis. Such moledules may bind to the cell receptor, blocking the 
binding of the automimmunie antibodies which cause the degeneration seen in such 
disease states. 

United States Patent Number 4,745, 051 discloses a method for expressing 
DNA in an insect cell, a method that has practical application for the production of 
30 glycosylated peptides and proteins. However, the glycosylation resulting is that 
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native to the insect, consisting of higher levels of mannose than are typical of 
mammalian cells. 

Practical production of peptides and polypeptides in bacterial production 
cells is well established Chemical and enzymatic means for glycosylating peptides 
5 and proteins are well known in the art. For example, United States Patent 5,370,872 
discloses a method for coupling PS through a carboxyl or hydroxyl group to a 
protein. Classic organic syntheses of complex carbohydrates have long been 
known, but with limited practical application. In addition to the difficulties inherent 
in the complexity of the glycopolymer molecule, many glycosidic bonds are labile 
1 0 and must be protected and deprotected during chemical synthesis, adding to the 
difficulty of synthesis and reducing the yield of product. 

Because of the drawbacks of organic synthesis, enzymatic synthesis has been 
devised. It is known that glycosylation proceeds by the step-wise addition of 
monomers through the action of such enzymes as glycotransferases. The reaction 
15 products can be further modified by lyases, acetylases, sulftases, phosphorylases, 
kinases, epimerases, methylases, transferases and the like. United States Patent 
Number 5,308,460 discloses such a step-wise synthesis on an immobilized matrix. 

A need remains for a more efficient and practical method for the production 
of complex carbohydrates, and glycoproteins and glycopeptides containing complex 
20 carbohydrates specific to a species or tissue. 

SUMMARY OF THE INVENTION 
The present invention is directed to the production of complex carbohydrates 
in a production cell. It is here disclosed that certain bacteria, such as Escherichia 
25 coli Strain K-12, have a core liposaccharide with a terminal heptose. A suitable 
production cell also contains an enzyme which catalyzes the transfer to the terminal 
heptose of an acceptor molecule, such as N-acetylglucosamine, to form a "scaffold" 
upon which glycotransferases add other saccharide monomers to form complex 
carbohydrates. If an otherwise suitable production cell lacks such an enzyme, the 
30 DNA encoding the gene rfe (UDP-GlcNAc:Undecaprenol GlcNAc-1 phosphate 
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transferase) of Haemophilus influenzae may be inserted into the production cell. 
Preferably, production of rfe is enhanced by the presence of the gene products of the 
gene IsgG. By inserting genes encoding glycotransferases into the production cell, 
the complex carbohydrates specific to bacteria such as Haemophilus influenzae, 
5 Neisseria spp r Salmonella spp and Escherichia coli are produced. Mammalian 
complex carbohydrate such as polysialyl can also be produced. 

Accordingly, the invention provides a process for the production of a 
complex carbohydrate which comprises the steps of; (a) inoculating transformed 
production cells into a culture medium capable of supporting the growth of said 

10 production cells wherein said production cells are prepared by transforming bacteria 
comprising (i) a core lipid structure containing a terminal heptose molecule and (ii) 
an enzyme capable of adding an acceptor molecule to said heptose molecule by 
inserting an isolated DNA sequence encoding glycotransferase synthesizes a 
complex carbohydrate into said bacteria to yield transformed production cells; (b) 

15 allowing growth of said transformed production cells; and (c) recovering said 
complex carbohydrate from the culture medium. 

The invention also provides a process for the production of an 
oligosaccharide which comprises the steps of: (a) transforming gram-negative 
bacteria comprising (i) a core lipid structure containing a terminal heptose and (ii) 

20 an enzyme that adds a galactose molecule to said heptose wherein said transformed 
gram-negative bacteria are prepared by constructing a vector comprising an isolated 
DNA sequence coding for a glycotransferase that synthesizes an oligosaccharide; 
(b) inoculating said transformed gram-negative bacteria into a culture medium 
capable of supporting the growth of said transformed bacteria; (c) allowing growth 

25 of said inoculated gram-negative bacteria; and (d) recovering said oligosaccharide 
from the culture medium. 

Using methods disclosed in this application, a production cell suitable for the 
practical production of other complex carbohydrates can be identified. Such a 
suitable production cell will have an acceptor molecule specific to the carbohydrate 

30 to be synthesized, or a site that can be modified to add such a specific acceptor 
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molecule. The production cell will contain the initiating IsgG or IsgF to form the 
appropriate acceptor. DNA coding for the glycotransferases of other species, 
strains, tissues, hormones, receptors or other cell-surface carbohydrates can then be 
inserted into such a production cell, with the resultant production of 
5 oligosaccharides or polysaccharide specific to that species, strain, tissue, hormone, 
receptor or other cell-surface carbohydrate. The nucleotide sequences for the genes 
rfc and IsG are on file in the K influenzae Rd database available from TIGR 
(Bethesda, MD). Sequences for glycotransferases are available from the references 
herein disclosed. 

10 Also provided are methods for separating and purifying the product. 

The invention also provides a process for the production of a complex 
carbohydrate, comprising culturing production cells comprising a chimeric DNA 
sequence encoding a glycotransferase, so as to yield production cells comprising an 
altered level of complex carbohydrate, wherein the production cells are bacteria 

1 5 comprising a core lipid structure containing a terminal heptose molecule and 
encoding an enzyme capable of adding an acceptor molecule to the heptose 
molecule. The invention also provides a process further comprising recovering the 
complex carbohydrate. 

20 BRIEF DESCRIPTION OF THR DRAWINGS 

Figure 1 : The Isg region of Haemophilus influenzae DNA. 
(A) Diagram of the eight orfs. (B) Locations of m-Tn3(Cm) insertion sites (6). (C) 
Restriction maps of the EMBLOS-1 subclones that modified the E. coli JM 109 
LPS. 

25 Figure 2: SDS-PAGE of the LPS from E. coli strain JM 109 (designated 

pGEM) and the three chimeric strains, pGEMLOS-7, pGEMLOS-5, and 
pGEMLOS-4. The LPS range in molecular weight from -3.3 to 5.5 kDa. 

Figure 3: Proposed structures of the chimeric oligosaccharides. Only the 
complete E. coli K-12 core structure containing a fourth heptose on the terminus of 

30 the oligosaccharide branch undergoes modification. Additional saccharides 
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(designated R) are added to the 7-position of this heptose to form the chimeric 
oligosaccharides. 

DETAILED DESCRIPTION OF THE INVENTION 
5 The present invention provides a method by which the terminal heptose of 

the core structure of any gram bacterial species which contains the gene rfe (UDP- 
GlcNAc:Undecaprenol GlcNAc-1 phosphate transferase) (Alexander et al. (1994) J. 
Bacteriol., 176-7079-7084) can be modified so as to act as an acceptor for 
oligosaccharide synthesis. The rfe gene encodes for a protein which catalyzes the 

10 transfer of N-acetyl glucosamine (GlcNAc, an "acceptor molecule") onto the carrier 
lipid undecaprenol phosphate. The regulation of this gene can be controlled with a 
regulatory gene, IsgG, identified in Haemophilus influenzae. The increase in rfe 
expression caused by IsgG mediates the deposition of a GlcNAc residue on the 
terminal heptose of LPS and LOS from a variety of gram-negative bacterial species 

15 including £. coli, Salmonella minnesota and H. influenzae. This GlcNAC has been 
found to function as an acceptor molecule forming a scaffold for the sequential 
addition of saccharide monomers, under the direction of glycotransferases. For 
example, the galactosyltransferase gene, IsgF, results in the addition of a galactose 
to the GlcNAc. The gene sequence coding for the Isg glycotransferases of if. 

20 influenzae has been inserted into an Escherichia coli K-12 strain production cell, 
with the resultant production ofK influenzae-specific LOS epitopes in E. coli. 

Any production cell containing an initiating enzyme similar to rfe can add an 
appropriate acceptor to form the scaffold. The production will preferably also 
contain the regulatory gene IsgG. DNA coding for the glycotransferases of other 

25 species, strains, tissues, hormones, receptors or other cell-surface carbohydrates can 
then be inserted into such a production cell, with the resultant production of 
oligosaccharides or polysaccharidea specific to that species, strain, tissue, hormone, 
receptor or other cell-surface carbohydrate. 

30 Definitions : 
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Complex carbohydrates: any polymer of formula (CH 2 0) n , where n equals at least 
three monomer units, including polymers with additional substituents including but 
not limited to S0 4 , P0 4 , C0 4 , CH 3 , NH 4 , and such polymers linked to lipids, 
peptides and proteins. 

5 

Production cell: A production cell useful in this invention is defined as any 

bacterium which contains an LPS or LOS-saccharide inner core terminating 
in a molecule and containing an enzyme capable of adding an acceptor 
molecule to the terminal molecule to serve as a scaffold for elongation and 
10 which can be transformed with exogenous DNA coding for 

glycosyltransferases. Cells that are otherwise suitable but lack the proper 
acceptor molecule may be used as production cells if they are co-transformed 
with genes such as rfe and IsgG, to appropriately modify the LPS or LOS to 
function as an acceptor molecule for the formation of a scaffold. 

15 

Rib production cell: the preferred cell for production of H. influenzae type B- 
specific OS is preferably a gram-negative bacteriuim, most preferably E. coli K-12 
strain JM 109. 

20 Synthetic gene(s): the DNA coding for the enzyme or enzymes that synthesize the 
desired complex carbohydrate. These genes include those coding for 
glycotransferases, lyases, acetylases, sulfatases, phosphorylases, kinases, 
epimerases, methylases and the like. 

25 Rxample 1 . Selection of a Hib production cell 

Capsular strains of Haemophilus influenzae type b (Hib) are responsible for 
various invasive and bacteraemic infections in humans, including meningitis and 
pneumonia. The surface lipooligosaccharides (LOS) of Hib are known to be 
important factors in microbial virulence and pathogenesis. Structural studies of Hib 

30 LOS from wild-type and mutant strains have shown that the LOS contains a 
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conserved heptose trisaccharide core which can be extended with additional sugars 
on each heptose. Recently, a revised structure of the E. coli K- 12 core region was 
reported which also contains a heptose trisaccharide inner core and a fourth heptose 
present on the terminus of the main oligosaccharide branch: 
5 Gala 1 Hepa 1 

I 1 

6 7 

Hepal - 6Glcal - 2Glcal - +3Glcal - 3Hep<xl - 3Hepal - 5Kdo 

1 0 Previous work showed that the core region of E. coli transformed with 

synthetic enzyme genes could be elongated by the addition of saccharide monomers 
under the direction of H. influenzae genes to produce a modified E. coli LPS that 
was elongated by approximately five monomer units. It was thought that the 
monomers were added at each of the heptoses. (Kwaik et al., Molecular 

15 Microbiology, 5:2475-2480 (1990).) Therefore, efforts were made to transform an 
E. coli K-12 strain termed JM109 with K influenzae synthesis genes in an attempt 
to determine whether an LOS substantially identical to that of H. influenzae could 
be produced. 

Escherichia coli strains were routinely cultured at 37° C using LB agar or 
20 broth with appropriate antibiotics. Vectors used in these studies were previously 
described. (Kwaik et al. (1990)). LPS from E. coli JM 109 was prepared by the 
extraction procedure of Darveau and Hancock (Darveau et al. J. BacterioL 155(2), 
831-838 (1983).) The LPS was separated by SDS-PAGE in resolving gels 
containing 15 % acrylamide, and visualized by silver staining. 
25 To determine the structure of this chimeric LPS, a few milligrams of LPS 

from each sample were treated with anhydrous hydrazine at 37° C for 20 minutes, 
and then precipitated with cold acetone. 

In order to establish the chemical structure of the E. coli core and determine 
the E. coli acceptor residue, the LPS from E. coli strain JM 109 was partially 
30 characterized using composition analysis, linkage analysis, and mass spectrometry 
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as described in Example 6 below. 

2. Isolation of the LOS synthetic genes from Hib. 

Hib strain A-2 was originally isolated from the spinal fluid of a child with 
5 meningitis. Hib A-2 was grown on chocolate agar supplemented with amino acids 
and vitamins or brain heart infusion agar supplemented with 4% Fildes reagent 
(Difco Laboratories, Detroit) at 35° C in 5% C0 2 atmosphere. 

A gene cluster from Hib strain A-2 containing LOS synthesis genes (Isg) was 
previously cloned (Kwaik et al, (1990). The Isg loci are contained within a 7.4 kb 
10 DNA fragment, consisting of seven complete open reading frames (orfs). This 

region is one of several distinct loci also found in the genome sequence of Hib strain 
Rd which has been associated with lipopolysaccharide (LPS) biosynthesis. 

DNA from the Isg region of Hib strain A-2 was used to construct a genomic 
library in the lambda bacteriophage EMBL3 (Kwaik et al, 1990)). Twenty six 
15 phage clones were prepared which expressed Hib LOS oligosaccharide epitopes in 
E. coli strain LE392. The phage transformant designated EMBLOS- 1 produced a 
chimeric LPS with a 1 .4 kDa oligosaccharide added to the 4. 1 kDa LPS of E. coli 
LE392. Monoclonal antibody (MAb) 6E4, which recognizes two components in the 
Hib A2 LOS mixture, also recognized the novel 5.5 kDa component in the chimeric 
20 LPS, indicating some immunochemical similarity to Hib LOS. 

Exam ple 3. Transformation of the Hib production cell. 

Restriction fragments of EMBLOS-I were used to make a series of plasmids 
which modified E. coli strain JM 109 to give clones which produced a proposed 

25 chimeric series of higher mass LPS species. The transformants termed 

pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 generated modified or chimeric LPS 
of 5.5, 5.1 , and 4.5 kDa, respectively. All three apparently modified the 4.1 kDa 
LPS species from E. coli, although only the LPS from pGEMLOS-4 expressed the 
6E4 epitope. The LPS from strain pGEMLOS-5 was found to react positively with 

30 MAb 3F 1 1, suggesting the presence of terminal N-acetyllactosamine. The epitope 
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recognized by MAb 6E4 is also present in the LOS of H. influenzae nontypable 
strain 2019, as well as the LPS from Salmonella minnesota Re mutant Binding of 
this monoclonal antibody to H. influenzae LOS can be inhibited by Kdo and the Kdo 
trisaccharide from the Re mutant of S. minnesota. Because the 6E4 epitope has been 
5 associated with the core of Haemophilus LOS, it was originally proposed that the 
chimeric structures expressed in E. coli might arise from the addition of a 
Haemophilus core structure to an acceptor residue of the E. coli 4.1 kDa LPS 
species. 

The Hib production cell was transformed with the plasmid pGEM3Zf+ into 
10 which different DNA restriction fragments from H. influenzae strain A-2 Isg locus 
had been ligated (see Table 1 and Figure 1). The plasmid pGEMLOS-4 contained a 
7.4 kb bamhl-pstl fragment of DNA which contained all seven open reading frames 
(A-G) comprising the Isg locus. The plasmid pGEMLOS-5 contained a 5.5 kb Hind 
lll-pstl fragment of DNA comprising 5 open reading frames (C-G) of the Isg. The 
1 5 plasmid pGEMLOS-7 contained a 2.8 kb sphl-pstl fragment of DNA comprising 2 
open reading frames (F-G) of the Isg locus. The plasmid pGEM3zf+ without an 
insert was also transformed into strain JM 109. This strain and the LPS isolated 
from it were termed PGEM. 

20 Rxample 4. Isolation and Purification of Oligo saccharides. 

The LPS from PGEM (31 mg), pGEMLOS-4 (25 mg), pGEMLOS-5 (15 
mg), and pGEMLOS-7 (4.4 mg) was hydrolyzed in 1% acetic acid (2 mg LPS/ml) 
for 2 hours at 100° C. The hydrolysates were centrifuged at 5000g for 20 min at 4° 
C and the supernatants removed The pellets were washed with 2 ml of H 2 0 and 

25 centrifuged again (5000g, 20 min, 4° C). The supernatants and washings were 

pooled and lyophilized to give the oligosaccharide fractions. As a standard, 10 mg 
of LPS from Salmonella typhimurium TV 1 19 Ra mutant (Sigma, St. Louis) was 
treated in the same fashion. 

To prepare desalted oligosaccharide pools for ESI-MS analysis, small 

30 aliquots of the crude oligosaccharide fractions (<2 mg) were chromatographed on 
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two Bio-Select SEC 125-5 HPLC columns (Bio-Rad 5 Richmond, CA) connected in 
series, using 0.05 M pyridinium acetate (pH 5.2) at a flow rate of lml/min. A 
refractive index detector was used to monitor column effluent and chromatograms 
were recorded and stored with an integrator. 

5 For large scale separations, the oligosaccharide fractions from PGEM (10.2 

mg), pGEMLOS-4 (9.3 mg), and pGEMLOS-5 (7.0 mg) were dissolved in 0.3 ml of 
0.05 M pyridinium acetate buffer (pH 5.2) and centrifuge- filtered through a 0.45 gm 
Nylon-66 membrane. The PGEM and pGEMLOS-4 samples were applied to a 
single Bio-Gel P-4 column (1.6 x 84 cm, <400 mesh; Bio-Rad), and the 

10 pGEMLOS-5 sample was applied to two Bio-Gel P-4 columns connected in series 
(1.6 x 79 cm and 1.6 x 76.5 cm). The columns were equipped with water jackets 
maintained at 30° C. Upward elution at a flow rate of 10 ml/h was achieved with a 
P-l peristaltic pump (Pharmacia, Piscataway). Effluent was monitored with 
refractive index and fractions were collected at 10 minute intervals and evaporated 

15 to dryness in a concentrator. 

Exam ple 5. Deph osphorylation of Oligosaccharides. 

Oligosaccharide fractions were placed in 1.5 ml polypropylene tubes and 
treated with cold 48% aqueous hydrogen fluoride to make 5-10 /xg/ml solutions. 
20 Samples were kept for 18 hours at 4° C and then aqueous HF was evaporated. The 
dephosphorylated samples were then rechromatographed on two Bio-Select SEC 
125-5 HPLC columns connected in series using 0.05 M pyridinium acetate (pH 5.2). 

Example 6. Characterization of product. 
25 The reactivity with monoclonal antibodies raised to the naturally occurring 

Hib LOS, as shown in Example 2, indicated that the product had the same 
immunochemcial function as the naturally occurring Hib LOS. The samples were 
further analyzed by different techniques in order to determine structural identity to 
the desired complex carbohydrate. 

30 
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a. Monosacc haride Composition Analysis. 

Dephosphorylated oligosaccharide fractions were dissolved in 400 /x\ of 2 M 
trifluoroacetic acid and heated for 4 hours at 100° C. The hydrolysates were 
evaporated to dryness in a Speed- Vac concentrator, redissolved in 20 /zl H 2 0, and 
dried again. Hydrolysates were analyzed by high-performance anion exchange 
chromatography with pulsed amperometrie detection using a Dionex BioLC system 
(Dionex 3 Sunnyvale, CA) with a CarboPac PA1 column. 

h. Methyla tion Analysis. 

Linkage analysis was performed on dephosphorylated oligosaccharide 
fractions using the microscale method modified for use with powdered NaOH. 
Partially methylated alditol acetates were analyzed by GC/MS in the El and CI 
modes on a mass spectrometer. 

r, Li quid Secondary Ton Mass Spectrometry (L SIMS), 

LSIMS was performed using a mass spectrometer with a cesium ion source. 
Oligosaccharide samples (in 1 /A H 2 0)were added to 1 fA of glycerol/thioglycerol (1 : 
1) on a stainless steel probe tip. A Cs+ ion primary beam energy of 10 keV was 
used and the secondary sample ions were accelerated to 8 keV. Scans were taken in 
the negative-ion mode at 300 s/decade and recorded with an electrostatic recorder. 
The spectra were mass calibrated manually with Ultramark 1621 (PCR Research 
Chemicals, Inc., Gainesville, FL) to an accuracy of better than ± 0.2 Da. 

d. Flectrospray Ionization Mass Spectrometty (ESI-MS). 

Oligosaccharides and O-deacylated LPS were analyzed on a mass 
spectrometer with an electrospray ion source operating in the negative-ion mode. 
Oligosaccharide samples were dissolved in H 2 0 mixed with running solvent (1 /xl 
in 4 Ail), and injected into a stream of H 2 0/acetonitrile (1/1, v/v) containing 1% 
acetic acid, at a flow rate of approx. 20 /A/min. Mass calibration was carried out 
with Csl in the negative-ion mode. 
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In some cases, selected oligosaccharide fractions were analyzed at higher 
resolving power (M/A M = 2000) using a sector-orthogonal time of flight (TOF) 
instrument with an array detector operating under ESI conditions in the negative-ion 
mode. In this case, the solvent system and flow rate were essentially the same as 

5 described above for the quadrupole ESI experiments. A scan speed of 5 sec/decade 
was used for all samples over the m/z range of 500 to 3000 with an accelerating 
voltage of 4 kV and an ESI needle voltage of between 3.5-4 kV higher. Mass 
calibration was carried out with an external reference consisting of Csl taken under 
liquid secondary ion mass spectrometry conditions, followed by a one point 

10 correction of the doubly charged deprotonated molecular ion of the oligosaccharide 
from the LPS of Salmonella typhimurium Ra mutant ((M-2H) 2 - exact = 973.2)) in 
the negative-ion ES-MS mode. 

p. Matrix Assisted T. a qe.r Desorp tion Ionization (MAT DTI Mass SnectrometTV. 

1 5 O-Deacylated LPS samples were analyzed on a Voyager or an Elite 

MALDI-TOF instrument (PerSeptive Biosystems, Framingham, MA) equipped with 
a nitrogen laser (337 nM). All spectra were recorded in the negative-ion mode using 
delayed extraction conditions as described in detail elsewhere. (Gibson et al. J. 
Amer. Soc. Mass Spec. 8:645-658 (1997)). Samples were dissolved in H 2 0 (approx. 

20 250 pmol///l), and mixed 1 : 1 with the matrix solution (a saturated solution of 

2,5-dihydroxybenzoic acid in acetone) and allowed to dry at room temperature on a 
gold-plated MALDI plate. Approximately 100 laser shots were recorded for each 
sample, averaged and then mass calibrated using an external mass calibrant 
consisting of renin substrate tetradecapeptide, insulin chain B, oxidized, and bovine 

25 insulin (all from Sigma). For external calibrations under these conditions, a mass 
accuracy of 0.1% was obtained. For comparison purposes, a single point correction 
was made to the spectra of the O-deacylated LPS from PGEM using the expected 
lipid A fragment ion ((M-H) average = 952.009), and then the spectra for the three 
chimeric strains were recalibrated using this lipid A fragment ion and an additional 

30 ion from PGEM (m/z 2835.7) present in all four samples. 
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f Tandem Mass Spectrometry (MS/MS) Using Q uadm pole-TOF (qTOF). 

Dephosphorylated oligosaccharides were analyzed in the positive-ion mode 
on a mass spectrometer equipped with a nanospray ion source. The analyzer 
consists of a high pressure RF-only ion guide followed by a quadrupole mass filter. 
5 A high pressure quadrupole collision cell follows the first mass filter. The TOF 
mass analyzer is comprised of a reflection with an effective flight path of 2.5 meters. 
Samples were dissolved in H 2 0/acetonitrile (1/1, v/v) containing 1% acetic acid, and 
2 {A of each was injected into a nanospray tip. The nanospray needle voltage was 
typically 800-1000 V. One sample loading usually gave an analysis time of 30-40 

10 min, which allowed a conventional mass spectrum to be obtained prior to the 
selection of several individual ions for CID MS/MS. In MS mode the high 
resolution capability (8,000 FWHM) allowed unambiguous determination of the 
charge state for each ion. For CED-MS/MS operation the quadropole mass analyzer 
with a mass window of I m/z unit was used to select precursor ions for 

15 fragmentation, which in most cases were doubly charged (M+2H) 2+ . The selected 
ions were fragmented in a collision cell with air as the collision gas and analyzed in 
the orthogonal TOF operating at an accelerating potential of 20 kV. Fragment ion 
spectra were accumulated under computer control for periods of between 10 seconds 
and 1 minute. Mass assignments based on external calibration were generally 

20 within 50 ppm of calculated monoisotopic values whereas internal calibration gave 
masses accurate to +5 ppm. 

g SDS-PAGF, Analysis of TPS. 

We have previously reported the transformation of E. coli strain JM 109 (a 

25 K- 12 strain which produces rough LPS (r-LPS) which lack the O-side-chain) with a 
series of plasmids containing overlapping restriction fragments of DNA from the Isg 
region of H. influenzae type b strain A-2 ( Kwaik et al, 1990). Partial LOS 
segments were produced As diagrammed in Figure 1, the pGEMLOS-4 clone 
contains all of the complete orfs (orfs A-G) in the Isg region, whereas pGEMLOS-5 

30 contains orfs C-G, and pGEMLOS-7 contains orfs F-G. The clones pGEMLOS-4, 
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pGEMLOS-5, and pGEMLOS-7 were shown by SDS-PAGE to produce modified 
LPS structures which added 1.4, 1.0, and 0.4 kDa moieties, respectively, to the 4.1 
kDa E. coli core (Figure 2). 

5 h Analysis of O-Deacylated T PS hv MALDT-TOF. 

For initial screening of LPS molecular weights and heterogeneity by mass 
spectrometry, small aliquots of LPS from PGEM, pGEMLOS-4, pGEMLOS-5, and 
pGEMLOS-7 were treated with anhydrous hydrazine to remove O-linked fatty acids 
from the lipid A moiety. The PGEM O-deacylated LPS sample contains several 

10 species in the range of 2738-3172 Da, representing the major E. coli core structures. 
When fit to proposed compositions, the observed species were found to exhibit 
heterogeneity in heptose (Hep), hexose (Hex), 3-deoxy-D-mannooctulosonic acid 
(Kdo), phosphate (Phos), and phosphoethanolamine (PEA) (Table 2). Specifically, 
two main core types were observed containing either 3 Hex and 3 Hep (with 2 or 3 

1 5 Kdos) or 4 Hex and 4 Hep (with 2 Kdos), with variable amounts of phosphate and 
PEA in both The pGEMLOS-7 O-deacylated LPS mixture contained many of these 
same species, in addition to two major new species at (M-H)- 3334.5 and 3456.8. 
The m/z 3334.5 species apparently arises from the addition of Hex and 
N-acetylhexosamine (HexNAc) to the PGEM core structure containing 4 Hex, 4 

20 Hep, 2 Kdo, 2 Phos, and 1 O-deacylated diphosphorylated Lipid A (O-DPLA) 
moiety. A further addition of 1 PEA moiety gives the m/z 3456.8 species. These 
data suggest that the transformation producing pGEMLOS-7 results in the addition 
of a Hex-HexNAc moiety to the E. coli LPS. Likewise, the main species in the 
pGEMLOS-5 O-deacylated LPS (m/z 3700.6 and 3823.6) were found to arise from 

25 the addition of 2 Hex plus 2 HexNAc to the PGEM core structure containing 4 Hex, 
4 Hep, 2 Kdo, 2 Phos, 1 O-DPLA, and 0 or 1 PEA (see Table 2). These structures 
are also found in the pGEMLOS-4 O-deacylated LPS, in addition to new species 
arising from the further addition of either another Hex (m/z 4083.2 and 4206.4) or 
HexNAc (m/z 4124.5 and 4246.8) to, in this case, the PGEM core structure 

30 containing 4 Hex, 4 Hep, 3 Kdo, 2 Phos, 1 O-DPLA, and 0 or 1 PEA (see Table 2). 
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Of the chimeric LPS structures, only these high molecular weight pGEMLOS-4 
components contained the third Kdo moiety. 

i Analysis o f Oli gosacch aride by EST-MS and LSIMS. 

The LPS from PGEM, pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 were 
subjected to mild acid hydrolysis to liberate free oligosaccharides. Initially, small 
aliquots of the oligosaccharide fractions were desalted by size exclusion HPLC and 
analyzed as mixtures by negative-ion ESI-MS. The ESI-MS spectra contained 
predominantly doubly charged ions, (M-2H) 2 ". In general, the data were consistent 
with results from the MALDI-TOF analysis of O-deacylated LPS. The PGEM 
sample was found to contain seven major oligosaccharides and several minor 
species, ranging in molecular weight from 1459.3 to 2016.7 Da. As shown in Table 
3, proposed compositions were determined for the various species which indicated 
that the structures consisted of two main core types; one containing 3 Hex, 3 Hep, 
and lKdo, and another containing 4 Hex, 4 Hep, and lKdo. Variability in the 
number of phosphate and PEA groups was responsible for the large number of 
species present in the mixture. 

The pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 samples contained many 
of the species found in the PGEM sample, in addition to larger molecular weight 
oligosaccharides (Table 3). New LPS glycoforms of M, 2177.7 and 2302.5 were 
observed in the pGEMLOS-7 sample, consistent with the addition of a single Hex 
and HexNAc residue to the PGEM core structure containing 4 Hex, 4 Hep, 1 Kdo, 2 
Phos, and 0 or 1 PEA The high molecular weight components of the pGEMLOS-5 
sample (M, 2543.9 and 2666.5) suggested the further addition of yet another 
Hex-HexNAc unit, and the pGEMLOS-4 sample contained even higher molecular 
weight materials (ranging from Mr 2706.1 to 2870.0) consistent with the addition of 
one more Hex or HexNAc moiety. 

To aid in the determination of proposed compositions for these species, 
oligosaccharides from the PGEM, pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 
samples were separated by size exclusion chromatography and fractions were 
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analyzed by LSIMS and/or ESI-MS. Selected fractions representing the two major 
PGEM core types and the various chimeric structures were then pooled, 
dephosphorylated with aqueous HF, rechromatographed on size-exclusion HPLC, 
and analyzed again by negative-ion LSIMS or ESI-MS. Proposed compositions for 
5 the molecular ions observed after HF-treatment are listed in Table 4. Upon removal 
of phosphate and PEA moieties, the major high mass species present in the 
pGEMLOS-7 sample is an oligosaccharide of M r (avg.) 2020.3 (lHexNAc, 5 Hex, 4 
Hep, and lKdo). The pGEMLOS-5 sample contains an oligosaccharide of M r (avg.) 
2386.3, resulting from the further addition of 1 Hex and lHexNAc to the 
10 pGEMLOS-7 LPS (2 HexNAc, 6 Hex, 4 Hep, and 1 Kdo). This species is also 
present in the pGEMLOS-4 sample, in addition to higher molecular weight 
structures containing an additional Hex (M r (avg.) 2548.4) or HexNAc (M^ (avg.) 
2589.5). 

15 j. Monosaccharide Composition arid Linkage Analyses. 

Mass spectrometric analyses of the free oligosaccharides from PGEM, 
pGEMLOS4, pGEMLOS-5, and pGEMLOS-7 indicated that the different chimeric 
structures arise from additions of stoichiometric amounts of hexose and HexNAc 
residues to a variably phosphorylated PGEM core structure containing 4 Hex, 4 

20 Hep, and lKdo. No chimeric structures were observed to contain the 3 Hex, 3 Hep, 
and lKdo core. 

For comparison purposes, dephosphorylated oligosaccharide fractions 
containing the two PGEM core types and the main chimeric structures from 
pGEMLOS-4, pGEMLOS-5, and pGEMLOS-7 were hydrolyzed in 2N 

25 trifluoroacetic acid to determine their monosaccharide compositions, and therefore 
the identities of the Hex and HexNAc residues. When analyzed by high pH anion 
exchange chromatography with pulsed amperometric detection, the PGEM 
hydrolysates were found to contain only galactose, glucose, and 
L-glycero-D-manno-heptose (Table 5). (The Kdo residue is not recovered under 

30 these hydrolysis conditions.) The two core types were identified as GalGlc 2 Hep 3 and 
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GalGc 3 Hep 4 . The pGEMLOS-7 sample contained GlcNH 2 Gal 2 Glc 3 Hep 4 (Table 5), 
suggesting that the larger PGEM core was being modified by the addition of one Gal 
and one N-acetylglucosamine (GlcNAc) residue. Likewise, the composition of the 
pGEMLOS-5 sample suggested that the larger PGEM core was being further 

5 glycosylated with only Gal and GlcNAc residues. Fraction 2 from pGEMLOS-4, 
which contained the same species as pGEMLOS-5, gave similar results, and fraction 
1 from pGEMLOS-4, which contains three main species (see Table 4), contained 
slightly more GlcNH2. 

Aliquots of the same six dephosphorylated oligosaccharide fractions used for 

10 monosaccharide composition analysis were taken for methylation analysis to 

establish sugar linkage positions. The partially methylated alditol acetates observed 
by GC/MS are listed in Table 6. Again, by comparing the two PGEM core types, it 
is relatively straightforward to see that the second terminal heptose of the larger 
PGEM core is converted to a 1,7- linked heptose in all of the chimeric structures and 

1 5 thus must represent the linkage site for the novel glycosylatioiL Since no chimeric 
structures were observed with the Hep 3 core, it is most likely that the nonreducing 
terminal heptose recently identified on the oligosaccharide branch in the K- 12 core 
structure is the modified terminal heptose. Additionally, no new trilinked 
saccharides were obtained from the chimeric oligosaccharides, suggesting that the 

20 sugars were most likely all added in a straight chain. 

k. Sequencing of Chimeric Oligosaccharides bv MS/MS. 

To confirm the identity of the linkage site between the E. coli LPS core and 
the novel oligosaccharide moieties, and to determine the sequences of the added 

25 sugars, the dephosphorylated oligosaccharides were subjected to MS/MS analysis. 
For these experiments, samples were run in the positive-ion mode and doubly 
charged molecular ions, (M+2H) 2+ , were selected for collision-induced dissociation 
(CID). Various reducing-terminal (Y-type) and non-reducing terminal (B-type) 
sequence ions are present in the spectra For the PGEM oligosaccharide, the Y ion 

30 series including the Y 6a , (m/z 732.2 (2+)), Y a5 , (m/z 65 1 2 (2+)), and Y 4a , (m/z 
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1 139.3) fragment ions, and the corresponding B ion series including the B 3a> (m/z 
517.2), B 4a , (m/z 841.3), B 5 (m/z 1225.4), and B 6 (m/z 1417.4) fragment ions, 
support the published structure with the fourth heptose on the non-reducing terminus 
of the largest oligosaccharide branch. In addition to these sequence ions, several 
ions present in the spectrum apparently arise from internal cleavages, which can 
occur under high energy CID conditions. In the spectrum of the pGEMLOS-5 
oligosaccharide, two similar Y and B-type ion series clearly define the sequence and 
linkage site of the added tetrasaccharide. Intense B ions at m/z 366.1 (B 2a ,') and 
731.3 (B 4a ,) arise from the sequential cleavage of two Hex-HexNAc moieties. 
These losses are also represented by the corresponding Y 9a ,' (m/z 2020.6) and 7a , 
(m/z 1655.5) fragment ions. Fragment ions at m/z 923.3 (B 5ct ,)0 and m/z 1463.4 (Y 
6 „,) confirm that the Hex-HexNAc-Hex-HexNAc moiety is linked to a heptose, and 
additional cleavages further along the large oligosaccharide branch confirm that the 
novel tetrasaccharide is attached to the largest branch of the PGEM core structure. 

In the MS/MS spectra of the chimeric oligosaccharides from pGEMLOS-7 
and pGEMLOS-4, intense B ions also clearly defined the structures of the added 
sugar moieties. In the pGEMLOS-7 oligosaccharide (M r 2019.7), a B ion at m/z 
366.1 corresponds to a single Hex-HexNAc disaccharide moiety. The pGEMLOS-4 
oligosaccharide of M r 2587.9 (HexNAc 3 Hex 6 Hep 4 Kdo) lost a 
HexNAc-Hex-HexNAc fragment (m/z 569.2) and a 
HexNAc-Hex-HexNAc-Hex-HexNAc fragment (m/z 934.3), whereas the 
pGEMLOS-4 oligosaccharide of M, 2546.8 (HexNAc 2 Hex 7 Hep 4 -do) lost a 
Hex-Hex-HexNAc (m/z 528.2) and a Hex-Hex-HexNAc-Hex-HexNAc (m/z 893.3,) 
fragment. In addition to those B-type ions, the latter spectrum also contained large 
ions at m/z 366.1 and 731.3, which apparently arise as internal fragments in that 
case. 

Assuming that the oligosaccharides are built up sequentially, i.e., from - 
pGEMLOS-7 to pGEMLOS-5 to pGEMLOS-4, the MS/MS data, in combination 
with our methylation analysis results, allows the partial structures of the chimeric 
oligosaccharides to be deduced as shown in Figure 3. 
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The structural data support the prediction that E. coli K- 12 transformed with 
plasmids containing portions of an eight gene segment from H. influenzae involved 
in LOS biosynthesis makes chimeric LPS which can be modified to produce 
oligosaccharide essentially identical to that of H. influenzae. Moreover, we have 
shown that the chimeric LPS are segregated hybrid-type structures, where the E. coli 
R-LPS core structure is first synthesized and then serves as a scaffold for H. 
influenzae LOS biosynthesis enzymes to add a second independent set of sugars not 
found in the parent E. coli strain. Thus, the biosynthetic pathways appear to be 
sequential (segregated) and not intermixed. 

Before this invention was made, the role of the terminal branch heptose in 
the E. coli R-LPS as the acceptor for oligosaccharide elongation or the requirement 
for a funcitonal initiator enzyme was unknown. The published structure for the 
complete E. coli K- 12 core did not contain a second terminal heptose, but rather 
had this fourth heptose as part of the inner core region. The oligosaccharide branch 
was believed to terminate in glucose, which was proposed to be the acceptor site for 
O-antigen and other substituents. The role of the initiator enzyme was unknown. It 
is now apparent that only E. coli R-LPS structures containing this fourth heptose 
(i.e., complete core structures) underwent elongation in the plasmid-transformed 
chimeric strains and thus, only those E. coli having this composition are useful as 
production cells for the production of H. influenzae. In the chimeric structures, 
GlcNAc is the first sugar added to the seven position of this heptose. There are two 
possible explanations for this crucial first step in the elongation sequence. One, an 
N-acetylglucosamine-specific glycosyltransferase from Haemophilus encoded in 
orff or orfg either has this precise specificity or is promiscuous enough to allow this 
reaction to occur. Two, some analogous E. coli glycosyltransferase gene is being 
activated by a Haemophilus regulatory gene. Since sequence comparisons of the 
seven genes contained in this plasmid suggest that both glycosyltransferase and 
regulatory genes are present, both explanations are possible. However, the fact that 
terminal GlcNAc and 1,7- linked heptose have been found in non-stoichiometric 
amounts in some other strains of E. coli K- 12 suggests that the addition of this 
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sugar to the terminus of the oligosaccharide branch is accomplished by E. coli 
enzymes. It was recently reported that when mutations causing the rough phenotype 
in E. coli K- 12 are complemented, the complemented strains produce an O-antigen 
which has GIcNAc at the reducing terminus of the repeat unit. Regardless of the 
5 mechanism of this first key step in the extension of the PGEM core, it appears that 
addition of this GIcNAc is rate-limiting, since a large percentage of unmodified 
PGEM core R-LPS remains in the chimeric mixtures. Furthermore, very little or no 
intermediate structures are observed as one progresses from pGEMLOS-7 to 
pGEMLOS-5 and pGEMLOS-4, suggesting that once this first GIcNAc is added, the 
1 0 addition of the other Haemophilus-velated sugars proceeds quickly to defined end 
points. If other steps in the biosynthesis of the chimeric LPS were as incomplete as 
the addition of this first GIcNAc, one would expect to see these other intermediate 
structures, yet none were observed. Therefore, it is likely that a N- 
acetylglucosaminyltransferase from E. coli that is regulated through the product of 
1 5 orff or orfg adds this first key sugar in the chimeric structures. Preliminary data on 
a chimeric construct containing orfg alone showed a mass shift of 203 Da 
(HexNAc), suggesting that orfg encodes this regulatory gene. 

The second step in the biosynthesis of the chimeric LPS is the addition of 
galactose to the 3-position of the terminal GIcNAc. The resulting disaccharide, 
20 Gall3 GIcNAc, is the structural moiety observed in pGEMLOS-7, which arises when 
the transforming plasmid contains orfs F-G from Haemophilus. Examination of the 
predicted amino acid sequences of the gene products indicates that orff has high 
homology (66% identity) to a galactosyltransferase (asmE) from Erwinia 
amylovora, suggesting that it may encode a galactosyltransferase in Haemophilus. 
25 OrfG does not show any homology to known oligosaccharide biosynthetic genes, 
but is homologous (64% identity) to a gene encoding the ModE protein in E. coli. 
This protein is involved in molybdenum transport and regulation of transcription, 
suggesting that orfG may encode a regulatory protein from Haemophilus (which 
may be regulating an N-acetylglucosaminyltransferase gene from E. coli in the 
30 chimeric strains). 
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In the pGEMLOS-5 strain, an additional three genes are contained in the 
transforming plasmid (orfs C-G) and an additional GlcNAc and Gal are observed in 
the resulting LPS. These sugars now define the tetrasaccharide 
Gall-4GIcNAc-3Gall3GlcNAc. The LPS from this transformant is now reactive to 

5 the 3F1 1 MAb, suggesting that this new disaccharide is betalinked to form the 
terminal trisaccharide, Gall4GlcNAcl-3Gal. All of the new orfs contained in this 
plasmid have some homology with known glycosyltransferase genes: orfC has 
homology with the asmD (26% identity) from Erwinia amylovora, which encodes a 
glycosyltransferase for exopolysaccharide synthesis, and TrsD (38% identity) from 

10 Yersina entercolitica, a gene involved in LPS inner core synthesis, orfD has 
homology with the sialyltransferase gene (1st) (27% identity) from Neisseria 
gonorrhoeae, and orfE has homology with a putative glycosyltransferase gene (77% 
identity) from Actinobacillus sp. and the galactosyltransferase gene, amsB (27% 
identity) from Erwinia amylovora. The fact that these three additional orfs in the 

1 5 transforming plasmid apparently result in the addition of only two more sugars to 
the growing oligosaccharide chain may indicate that the acceptor for one of the 
glycosyltransferases is absent in the chimeric LPS. 

When two more orfs are added in the transforming plasmid (orfs A-G) to 
form the pGEMLOS-4 chimeric strain, we observe that the 3F1 1 epitope disappears 

20 and the terminal Gal residue of the epitope is capped by either a second Gal or a 
GlcNAc moiety, apparently linked to the 6-position of the Gal. These new species 
present in the pGEMLOS-4 LPS population were also observed to contain a third 
Kdo moiety, presumably somewhere in their core regions. While some of the 
incomplete core structures found in the wild-type E. coli K- 12 LPS populations also 

25 contain a third Kdo, of the chimeric structures, only the structures unique to 
pGEMLOS-4 were found to contain a third Kdo. This chimeric strain was also 
recognized by MAb 6E4 which recognizes an inner core, Kdo-related epitope in K 
influenzae, suggesting that this third Kdo forms a different epitope than the one 
found in the core structure of the wild-type E. coli LPS. Thus, the addition of orfs A 

30 and B to the transforming plasmid which fon-ned strain pGEMLOS-4 seems to have 
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multiple effects on the chimeric LPS structure. OrfB is homologous (46% identity) 
to the sialyltransferase gene from K gonnorhea. OrfA is homologous to both the 
Rfb X gene product (22 % identity) from E. coli and TrsA (24 % identity) of 7. 
entercolitica. These are putative O-antigen transporters (36,37), suggesting that 
5 orfA may encode a flippase. 

While sialyl-N-acetyllactosamine-containing structures are only minor 
components of the wild-type H. in fluenzae type b strain A2 LOS population, we 
have previously seen that Isg genes are involved in the synthesis of this epitope. 
Transposon mutagenesis of orfD produced mutant strain 281.25, that lost all ability 

10 to add galactose to Hib LOS glycoforms. This strain could not make any of the 
wild-type LOS structures larger than the major species containing four glucoses and 
three heptoses. Mutation of orfE (which is downstream of orfD) produced strain 
276.4 which had essentially the same defect, except for one important difference: 
strain 276.4 retained the ability to make the sialyl-N-acetyllactosamine epitope. 

1 5 These results suggest that in the transposon mutants, the knockout of orfD has a 
polar effect on orfE, which would imply that the gene product of orfE is a 
galactosyltransferase required for synthesis of the higher molecular weight wild-type 
structures containing terminal galactose(s) on their glucose disaccharide branches 
and the gene product of orfD is likely an A^-acetylglucosaminyltransferase required 

20 for the synthesis of the sialyl-N-acetyllactosamine epitope. The case for these 
assignments can be made on the basis of the homologies noted above (orfE is 
homologous to a galactosyltransferase gene) and the LOS glycoforms observed in 
the 276.4 and 281.25 mutant strains. Since no truncated versions of the sialyl-N- 
acetyllactosamine structure were seen in the 276.4 LOS population (i.e.; no species 

25 lacking either sialic acid or sialic acid plus galactose), it seems probable that the 
orfd gene codes for the glycosyltransferase which adds the GlcNAc to the 
oligosaccharide branch. This is also consistent with the observation that one of the 
genes in orfd C-E is apparently responsible for adding GlcNAc to the 3-position of 
the Gal which is terminal in the pGEMLOS-7 LPS structure. 

30 This chimeric carbohydrate expression system has provided information that 
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is relevant to unraveling the functions of these Isg genes and has the additional 
advantage of being carried out in the absence of the normal endogenous genetic 
background on H. influenzae. Indeed, while gene knockouts of some of the Isg 
genes in H. influenzae have been completed, downstream or regulatory gene effects 

5 can often complicate their functional analysis. In this E. coli expression system, 
structural analysis of the resulting chimeric LPS has shown that synthesis proceeded 
as a serial (non-parallel) synthesis, that is, the new elements of the chimeric LPS 
were added after the formation of the E. coli R-LPS. The fact that this synthesis was 
sequential (rather than interdigitated with the R-LPS synthesis, for example) 

1 0 allowed for the functions of these H. influenzae gene products to be more readily 
delineated from the chimeric oligosaccharide structures. Moreover, screening of the 
chimeric LPS products with monoclonal antibodies enabled us to follow the 
formation of terminal sugar sequences (epitopes) that are unique to the Haemophilus 
strain from which the plasmid DNA originated. 

1 5 All publications and patents cited herein are incorporated by reference as 

though fully set forth. 

This invention has been described with respect to specific examples and 
embodiments. However, it is understood that one skilled in the art may make 
variations or modifications that are within the spirit and scope of the invention. 
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Table 1. Bacterial Strains, LPS and Vectors 



Strain/ 
Plasmid 

E. coli 

JM109 

H. influenzae 
A2 

Plasmid 
pGEM3Zf+ 



Relevant characteristics 



recA, supE, hsR, 
NJac-pro) 



Parental strain 



Reference/source 



(40) 



(10) 



Promega Biotech 



pGEMLOS-4 Ap R , contains 7.4 kb bamHI-pstl This study 

DNA K influenzae Isg locus 

pGEMLOS-5 Ap R , contains 5.5 kb hindlll-pstl This study 

DNA H. influenzae Isg locus 

pGEMLOS-7 Ap R , contains 2.% kb sphl-pstl This study 

DNA H. influenzae Isg locus 



LPS 
pGEM 

pGEMLOS-4 
pGEMLOS-5 
pGEMLOS-7 



isolated from strain JM 1 09 This study 

transformed with pGEM3zf+ 

isolated from strain JM 1 09 This study 

transformed with the plasmid 
pGEMLOS-4 

isolated from strain JM 1 09 This study 

transformed with the plasmid 
pGEMLOS-5 

isolated from strain JM 1 09 This study 

transformed with the plasmid 
pGEMLOS-7 
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Table 4. Molecular weights (Af r ) and proposed compositions of the dephosphorylated 
oligosaccharides from pGEM, pGEMLOS-7, pGEMLOS-5, and pGEMLOS-4 fl 
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a Observed molecular weights are reported as average mass values. 



Table 5. Monosaccharide compositions of the dephosphorylated oligosaccharide fractions" 
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a Molar ratios were derived from comparison to the hydrolysate of the 5. typhimurium Ra 
oligosaccharide of known composition, and then those values were normalized to either 3.0 or 
4.0 heptoses in each fraction. 
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Table 6. Methylation anai, of the dephosphorylated oligosacc. -fide fractions' 1 
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a Peak areas were measured from the GC/MS EI total ion chromatograms, and values were 
normalized to the 1,3,6-glc residue. The data for pGEMLOS-7 are the average of two runs. 
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WHAT IS CLAIMED IS: 



1 . A process for the production of a complex carbohydrate which comprises the 
steps of: 

(a) inoculating transformed production cells into a culture medium 
capable of supporting the growth of said production cells wherein said production 
cells are prepared by transforming bacteria comprising (i) a core lipid structure 
containing a terminal heptose molecule and (ii) an enzyme capable of adding an 
acceptor molecule to said heptose molecule by inserting an isolated DNA sequence 
encoding glycotransferase synthesizes a complex carbohydrate into said bacteria to 
yield transformed production cells; 

(b) allowing growth of said transformed production cells; and 

(c) recovering said complex carbohydrate from the culture medium. 

2. The transformed production cell of claim 1 . 

3. The transformed production cell of claim 2 comprising a gram-negative 
bacterium having a terminal heptose on a kdo core and having inserted an isolated 
DNA sequence encoding the glycotransferase catalyzing the synthesis of an 
oligosaccharide of Haemophilus influenzae. 

4. The transformed production cell of claim 3 comprising Escherichia coli K- 
12 strain JM 109. 

5. The process of claim 1 wherein the bacteria are gram-negative bacteria. 

6. The process of claim 1 wherein the bacteria is Escherichia coli K-12 strain 
JM 109. 
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7. The process of claim 1 wherein the acceptor molecule is [N-acetyl] 
galactose. 

8. The process of claim 1 wherein the isolated DNA sequence encodes a 
functional Haemophilus influenzae glycotransferase. 

9. The process of claim 1 wherein the isolated DNA sequence encodes a 
functional Neisseria gonorrhoeae glycotransferase. 

10. The complex carbohydrate made according to the process of claim 1 . 

11. A process for the production of an oligosaccharide which comprises the steps 
of: 

(a) transforming gram-negative bacteria comprising (i) a core lipid 
structure containing a terminal heptose and (ii) an enzyme that adds a galactose 
molecule to said heptose wherein said transformed gram-negative bacteria are 
prepared by constructing a vector comprising an isolated DNA sequence coding for 
a glycotransferase that synthesizes an oligosaccharide; 

(b) inoculating said transformed gram-negative bacteria into a 
culture medium capable of supporting the growth of said 
transformed bacteria; 

(c) allowing growth of said inoculated gram-negative bacteria; and 

(d) recovering said oligosaccharide from the culture medium. 

12. The process of claim 1 1 wherein the transformed bacteria is Escherichia coli 
K-12 transformed with an isolated DNA sequence from Haemophilus influenzae. 

1 3 . The oligosaccharide made by the process of claim 1 1 . 
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14. The process of claim 1 1 wherein the transformed bacteria is Escherichia coli 
K-12 transformed with an isolated DNA sequence from Neisseria gonorrhoeae. 

1 5 . The oligosaccharide made by the process of claim 14. 

1 6. A process for the production of a complex carbohydrate, comprising 
culturing production cells comprising a chimeric DNA sequence encoding a 
glycotransferase so as to yield production cells comprising an altered level of 
complex carbohydrate, wherein the production cells are bacteria comprising a core 
lipid structure containing a terminal heptose molecule and encoding an enzyme 
capable of adding an acceptor molecule to the heptose molecule. 

17. The process of claim 16 further comprising recovering the complex 
carbohydrate. 
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ABSTRACT OF THE INVENTION 



Compositions and methods for making complex carbohydrates in a bacterial 
production cell are disclosed. The complex carbohydrates that can be made include 
oligosaccharides and polysaccharides of bacterial or mammalian origin. 
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ftlief are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so 
SSSJSS^ fine or imprisonment, or both, under Section 1001 of Title 18 of the Umted States Code and that such willful false 
Statements may jeopardize the validity of the application or any patent issued thereon. 



iPuil Name of joint inventor number 1 : Michael A, Apicella 
Cgitizenship: United States of America 

fSpost Office Address: 2646 Johnsons Crossing, N.E. 

= Solon, IA 52333 



Residence: Solon, IA 



Signature: 



Date: 



Michael A. Apicella 



Full Name of joint inventor number 2 : Bradford W, Gifoson 

Citizenship: United States of America Residence: Berkeley, CA 

Post Office Address: 1324 Peralta Avenue 

Berkeley, CA 94702 



c . . Date: 

Signature: 

Bradford W. Gibson 



X Additional inventors are being named on separately numbered sheets, attached 
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I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of the application or any patent issued thereon. 

Full Name of joint inventor number 3 : Nancy J« Phillips 

Citizenship: United States of America Residence: Oakland, CA 

Post Office Address: 60 1 8 Colby Street 

Oakland, CA 94618 



Signature : „ ■ Date : 

Nancy J. Phillips 



Full Name of inventor: 

Citizenship: Residence: 
Post Office Address: 



Signature: Date: 



full Name of inventor: 

gtizenship: Residence: 
Post Office Address: 



•u. ^ . Date: 

^Signature: — 



Full Name of inventor: 

Citizenship: Residence: 
Post Office Address: 



Signature: 



Date: 



t t 
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§ 1.56 Duty to disclose information material to patentability. 

(a) A patent by its very nature is affected with a public interest. The public interest is best served, and the most effective patent 
examination occurs when, at the time an application is being examined, the Office is aware of and evaluates the teachings of all information 
material to patentability. Each individual associated with the filing and prosecution of a patent application has a duty of candor and good 
faith in dealing with the Office, which includes a duty to disclose to the Office all information known to that individual to be material to 
patentability as defined in this section. The duty to disclose information exists with respect to each pending claim until the claim is canceled 
or withdrawn from consideration, or the application becomes abandoned. Information material to the patentability of a claim that is 
canceled or withdrawn from consideration need not be submitted if the information is not material to the patentability of any claim 
remaining under consideration in the application. There is no duty to submit information which is not material to the patentability of any 
existing claim The duty to disclose all information known to be material to patentability is deemed to be satisfied if all information known 
to be material to patentability of any claim issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by 
§§ 1 97(b)-(d) and 1.98. However, no patent will be granted on an application in connection with which fraud on the Office was practiced 
or attempted or the duty of disclosure was violated through bad faith or intentional misconduct. The Office encourages applicants to 
carefully examine: 

(1) prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) the closest information over which individuals associated with the filing or prosecution of a patent application believe any 
pending claim patentably defines, to make sure that any material information contained therein is disclosed to the Office. 

2(b) Under this section, information is material to patentability when it is not cumulative to information already of record or being 
made of record in the application, and 

(l) It establishes, by itself or in combination with other information, a prima facie case of unpatentability of a claim; or 
£ (2) It refutes, or is inconsistent with, a position the applicant takes in: 

Q (i) Opposing an argument of unpatentability relied on by the Office, or 

0 (ii) Asserting an argument of patentability. 

1 prima facie case of unpatentability is established when the information compels a conclusion that a claim is unpatentable ™te the 

m Serance of evidence, burden-of-proof standard, giving each term in the claim its broadest reasonable consteuction consisted with the 
Sfore any consideration is given to evidence which may be submitted in an attempt to estabhsh a contrary conclusion of 

l^tentability. 

(c) Individuals associated with the filing or prosecution of a patent application within the meaning of this section are: 

(1) Each inventor named in the application: 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(2) Every other person who is substantively involved in the preparation or prosecution of the application and who is 

associated with the inventor, with the assignee or with anyone to whom there is an obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing information to the attorney, 
agent, or inventor. 



